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Dear Josef and Michel, 

Just a little precision. I noticed that the link in the Newsletter doesn’t point toward my 
website ! 

«Find a collection of animations from Canada (Genevieve Savard, Montreal) 
http://www.seq.etsmtl.ca/Math/Animations/index.html » 

This website was created by Robert Michaud. 

My website is http://www.seq.etsmtl.ca/GSavard/index.html and the animation page is 
http://www.seq.etsmtl.ca/GSavard/Animations/index.html 

I wish you a very nice day, 

Genevieve 

Genevieve Savard 

MaTtre d'enseignement en mathematiques 
Ecole de technologie superieure 


Interesting and recommended websites: 

SeeLogo (APGS) is a computer language through which the user can create beautiful pictures, 
dynamic arts and make games. Here is a link to a book that uses the language to create mathe- 
matical art. 

http: www.ithaca.edu/seelogo/ 


You can find and download another free CAS-program CoCoA (in many languages). 

• CoCoA is a program to compute with numbers and polynomials. 

• It is free. 

• It works on many operating systems. 

• It is used by many researchers, but can be useful even for "simple" computations. 

http : / / cocoa . dima . unige . it/ 


Visual Interactive Tools for Advanced Learning: 

http : / / www . mathe-vital . de 

Bei MatheVital handelt es sich um eine modulare, frei zugangliche Sammlung interakti- 
ver Materialien fur den Unterricht in mathematiknahen Fachern. 

Interoperable Interactive Geometry for Europe (many languages) 
A new platform for Dynamic Geometry programs: 

http : / / i2geo . net 


Find more Links on page 3 
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Dear DUG Members, 

As I promised in the last DNL I'll give a report of the official DUG Meeting which 
was held at TIME 2010 in Malaga. Every four years the DUG-board must be elected. 
There are no changes in the board, all members accepted staying in our commission: 
Barbel Barzel, Josef <& Noor Bohm, Walter Klinger, Bernhard Kutzler and Josef 
Lechner (in alphabetical order). Thanks to all of you for your work in the past and 
much success for the next 4 years' period. 

We had many excellent talks and workshops in both Conference Strands. Unfortu- 
nately I could not attend so many of them because of giving my own lecture(s) and 
workshop or being occupied as chair of other sessions. So I am looking forward to 
browsing and studying the Conference Proceedings which should be ready soon. I' II 
keep you informed. (If you want to have a look to it in advance, you can go to 
http://www.time2010.uma.es/abtracts.pdf .) 

We had great keynotes. The picture shows 
Michel Beaudin talking about "Using the Real 
Power of Computer Algebra". I feel reminded 
on Hamlet with the ghost of Hamlet's father 
in the background. (It is our friend Terence 
Etchells who could not participate, so he 
appeared as a good ghost in during Michel's 
talk.) 

We all are indebted to the generous sponsors of the Conference: University of 
Malaga and some faculties, Authorities of Malaga and Antequera, Texas Instru- 
ments, Unicaja. Many thanks to you all. 

In this DNL you will not find so many articles as usual. The contributions of 
DNL#79 are very extended. Don Phillips provides a tool for Nonlinear Regression 
and 2-Stage Least Squares Regression and demonstrates in an impressive way that 
it is possible to transfer programs from DERIVE to the TI89/92/V200 and to TI- 
NspireCAS as well. 

My article on the Huff man-Code makes use of JohannWiesenbauer's tool from the 
last DNL for plotting binary trees. 

Please pay attention to the many links to excellent websites given in the Informa- 
tion page and on page 3. Thanks again to Michael de Villiers from South Africa for 
his valuable notes. 

It is a nice cooccurrence that Michel Beaudin wrote about cubics in the last DNL 
and we have another in this DNL request on the same issue. Btw there is an inter- 
esting paper on Cardano's formula in The Montana Mathematics Enthusiast, 2005, 
vol. 2. You can download it, see the links. 

Best regards as ever, 

Download all ZWZ.-Derive- and Tl-files from 

http : / / www . austromath . at/ dug/ 
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Contributions: 

Please send all contributions to the Editor. 
Non-English speakers are encouraged to 
write their contributions in English to rein- 
force the international touch of the DNL. It 
must be said, though, that non-English 
articles will be warmly welcomed nonethe- 
less. Your contributions will be edited but 
not assessed. By submitting articles the 
author gives his consent for reprinting it in 
the DNL. The more contributions you will 
send, the more lively and richer in contents 
the DERIVE & CAS-77 Newsletter will be. 


Next issue: December 2010 

Deadline 1 5 December 2010 

Preview: Contributions waiting to be published 

Some simulations of Random Experiments, J. Bohm, AUT, Lorenz Kopp, GER 

Wonderful World of Pedal Curves, J. Bohm 

Tools for 3D-Problems, P. Liike-Rosendahl, GER 

Financial Mathematics 4, M. R. Phillips 

Hill-Encription, J. Bohm 

Simulating a Graphing Calculator in DERIVE, J. Bohm 
Henon & Co, J. Bohm 

Do you know this? Cabri & CAS on PC and Handheld, W. Wegscheider, AUT 

An Interesting Problem with a Triangle, Steiner Point, P. Liike-Rosendahl, GER 

Overcoming Branch & Bound by Simulation, J. Bohm, AUT 

Diophantine Polynomials, D. E. McDougall, Canada 

Graphics World, Currency Change, P. Charland, CAN 

Cubics, Quartics - Interesting features, T. Koller & J. Bohm 

Logos of Companies as an Inspiration for Math Teaching 

Exciting Surfaces in the FAZ / Pierre Charland 's Graphics Gallery 

BooleanPlots.mth, P. Schofield, UK 

Old traditional examples for a CAS - what's new? J. Bohm, AUT 
Truth Tables on the Tl, M. R. Phillips 

Where oh Where is It? (GPS with CAS), C. & P. Leinbach, USA 

Embroidery Patterns, H. Ludwig, GER 

Mandelbrot and Newton with DERIVE, Roman Hasek, CZ 

Snail-shells, Piotr Trebisz, GER 

A Conics-Explorer, J. Bohm, AUT 

Practise Working with Times 

Tutorials for the NSpireCAS, G. Herweyers, BEL 

Some Projects with Students, R. Schroder, GER 

Dirac Algebra, Clifford Algebra, D. R. Lunsford, USA 

The PROOF, C. Leinbach & J. Bohm 

Treating Differential Equations (M. Beaudin, G. Piccard, Ch. Trottier) 
and others 

Impressum: 

Medieninhaber: DERIVE User Group, A-3042 Wiirmla, D'Lust 1, AUSTRIA 
Richtung: Fachzeitschrift 
Herausgeber: Mag. Josef Bohm 


The DERIVE-NEWSLETTER is the Bulle- 
tin of the DERIVE & CAS- 77 User Group. 
It is published at least four times a year 
with a contents of 40 pages minimum. The 
goals of the DNL are to enable the ex- 
change of experiences made with DERIVE, 
77-CAS and other CAS as well to create a 
group to discuss the possibilities of new 
methodical and didactical manners in 
teaching mathematics. 

Editor: Mag. Josef Bdhm 
D'Lust 1, A-3042 Wiirmla 
Austria 

Phone: ++43-06604070480 

e-mail: nojo.boehm@pgv.at 
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More Links which might be of interest for you: 

(Thanks to Michael de Villiers, who provided valueable links!) 


The Association for Mathematics Education of South Africa Congress 2010 Proceedings 

http : / / www . amesa . org . za/AMESA2 0 1 0 /Proceedings . htm 

Volume 1: Lectures (337 pages), Volume 2: Workshops & How I Teach (330 pages) 

Download free textbooks from bookboon: 

Do you need math help? In our free mathematics books you hopefully will find answers to 
your questions. These textbooks will guide you through mathematical concepts and models, 
and hopefully give you a better understanding. Topics such as limit value, linear optimization 
and the decay constant are explained. Specifically for computer science students, we pro- 
vide the book 'Mathematics for Computer Scientists'. (Announcement from bookboon) 

http : / /bookboon . com/ uk/ student/mathematics 

Examples: Applied Mathematics 

Calculus 

Complex Functions 

Systems of Differential Equations 

Probability for Finance a.o. 


The July 2010 issue of The Montana Mathematics Enthusiast can be downloaded for free: 

http : // www .math . umt . edu/TMME/vol7no2and3/ index . html 
http : / / www . math . umt . edu/TMME/vo!7nol / 


This is the URL of the Montana Council of Teachers of Mathematics 

http : / / www . montanamath . org/ 


You can download CINDERELLA 1.4 for free from 

http : // www . Cinderella . de/ tiki -index . php 




p 4 Derive- and CAS-TI-User Forum D-N-L#79 


J.L., Austria 
Lieber Josef, 

ich mochte Dir zwei Files zur Formel von Cardano schicken. 

Ich versuche gerade die Gleichung x 3 +3x 2 +9x+9=0 mit der F.v.C. zu losen. 

Das funktioniert mit TI-NSpire 2.1 problemlos, bei Derive kriege ich aber nicht das richtige 
Ergebnis (siehe #7). Irgendetwas lauft da nicht korrekt, entweder habe ich irgend einen trivi- 
alen Fehler gemacht Oder eine Voreinstellung ist ungunstig Oder ich kann mit Wurzeln nicht 
richtig umgehen Oder ... 

Dear Josef, 

I am sending two files wrt Cardano's formula. I am trying solving x 3 +3x 2 +9x+9=0 applying 
this formula. This is no problem with Tl-Nspire, but I am not able to obtain the correct result 
with DERIVE, maybe that I made a trivial mistake, a typo, a wrong setting . . . 
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The DERIVE file: 

3 2 

#1: SOLVE(x + 3 • x + 9-x + 9 = 0, x) 


#2: x 


2/3 1/3 

2 2 

1 L ■ 

f 2 / 3 

73-2 

1/3 1 

^3-2 

2/3 

2 

1/3 

2 

1 + L • 

f 2/3 

73-2 

2 2 

W 2 

2 

2 

2 

, 2 


73- 2 


1/3 


2/3 1/3 

v x = - 2 +2 -1 


#3: x = -0.8362599989 - 2 . 465853272 . i v x = -0.8362599989 + 2 . 465853272 . L v x = -1.327480002 

#4: [p := 3, q := 9, r := 9] 


#5: u := 


#6: 


#7: v := 


' 3 

2-p - 9-pq + 27-r 

f 3 2 

(2 *p - 9-p-q + 27- r) 

2 3 1 

(3-q - p ) 

1/2' 

W 34 

s 2916 

729 

> 


1/3 


2-p - 9*p -q + 27 *r 


54 


1/3 

u := 2 

3 2 2 3 ^1/2^ 1/3 

(2-p - 9-p-q + 27 • r) (3-q - p ) 

+ 


#8: 


2916 

2/3 2/3 

2 73-2 • L 

v := + 


729 


#9: 


p u + v p u-v 1/2 u + v p 

xl :: u + v , x2 := - - + (-3) , x3 :i - 

3 2 3 2 2 3 

u-v 1/2 

(-3) 


#10: [xl := 1.053621575 + 1. 374729636 • L , x2 := -0.8362599989 - 0 . 2836060010 • L , x3 := -3.217361576 

1.09112363 5. l] 

Compare #3 and #10! 


DNL: Dear Josef, try this: 

Set Branch : =Real , then it works! ! 

#11: Branch := Real 


#12: u := 


' 3 

2-p - 9-p-q + 27-r 

' 3 2 

(2-p - 9-p-q + 27-r) 

2 3) 

(3-q - p ) 

1/2' 

, 54 

, 2916 

729 

> 


1/3 


#13: u := 2 


#14: v := 


1/3 


2-p - 9-p -q + 27-r 


54 


3 2 2 3 

(2-p - 9-p-q + 27-r) (3-q - p ) 

+ 


2916 


729 


1 / 2 ' 


1/3 


#15: v := - 2 


2/3 


#16: 


p u+v p u-v 1/2 u + v p 

xl := u + v - , x2 := - - + (-3) , x3 := - - — 

3 2 3 2 2 3 

u-v 1/2 

(-3) 


#17: 


[xl :z -1.327480002, x2 :z -0.8362599989 + 2 . 465853272 • i ■ x3 :z - 0.83625 99 989 - 2 . 46585327 2 • il 
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When I prepared my talk for TIME 2010 about the use and didactical value of sliders in mathemtics 
education I came across a strange behaviour of TI-NspireCAS. 

I wanted to demonstrate the Taylor approximation supported by two sliders - one for the location of 
the Taylor expansion (= a) and another one for the order of the Taylor polynomial (= n). 



As you can see in the screen shot above this works for function e x • sin(2x). 



Then I tried e 3 • sin(2x) and I failed. I wrote to TI (Gosia Brothers) and she answered: 

We know that this is a problem with our series code dealing with floats. You need only put 
approx() around taylor() to see this problem in Calculator or Notes. David Stoutemyer devel- 
oped this code and he is currently working on updating it. Not sure when the fix will be in but 
I will let you know as soon as I know. 


Thnaks to Gosia for the immediate reply. The bug has not been resolved in TI-NspireCAS 2.0. 


D 


-N-L#79 


Josef Bohm: The Huffman-Code 


P 7 


Zipped today? - The Huffman-Code 

Josef Bohm, Wiirmla, Austria 

When you want to send big amounts of data via email, then you will probably compress the file(s). 
Some programs do the job. One of these programs gave the name: “zipping 44 . (Which one?) Com- 
pressed files show mostly the file extension zip or rar. Especially graphic files (photographs, scanned 
images, . . ., and dfw-files, of course) may become very large. It is easy to reach a couple of mega- 
bytes. Intelligent algorithms provide a compression to a significant smaller amount of data without 
loss of data. 


Name 

Typ 

Datum 

Grofte 

Komp... 

Kompri... | PI 

|D[Verkehr.dfw j 

Derive Worksheet 

13.10.2004 15:37 

7 220 411 

33 % 

31 331 


The picture shows the result of „zipping“ or „packing“ a Derive-file which contains some graphs. You 
can see the efficiency of the compression algorithm. 

A very simple method is the following: in a graph appears a sequence of 3878 white image points 
(pixels) followed by a sequence of 132 black pixels. Instead of listing w, w, ..., w, b, b, ..., b one can 
note much shorter: w, 3878, b, 132, loosing no information at all. The next paragraph is some text (in 
German) which shall be used for demonstrating compression for transmission. 

“bei der komprimierung von texten laesst man sich von der unterschiedlichen haeu- 
figkeit der zeichen in dem zu codierenden text leiten. wir wollen das an einem einfa- 
chen beispiel demonstrieren, wobei wir nur kleinbuchstaben, zwischenraeume und 
satzzeichen verwenden wollen. dieser text wird verwendet.“ 

Characters with a high frequency will be assigned to short code words in order to save bits! This basic 
idea is also realized in the Morse-Code. Character “e” is encoded by a code word of length 1, the 
character “q” which is pretty rare by a code word of length 4, 66 “. 

We create a „binary tree 64 . This is a directed graph consisting of vertices or nodes and edges (arrows) 
(= branches of the tree). Our first message 44 demo 44 to be encoded is 44 dieser text wird verwendet 44 . 
Using the ASCII-Code without check bit the length of the message is 189 bit. 

First of all we find out the frequency of the characters by simply counting. This can be done manually. 
For extended texts we will use the computer. 


DIM (list) 


#1: 

h(li'st, number) := Z IF("h'st = number) 

i'=l i 


#2: 

freq (plain) := 

50RT(SELECT(v t 0, v, VECTOR ([h(N£ME_T0_C0DES (plain) , k) , C0DES_T0_NAME(k) ] , k, 32, 127)))’ 
1 

#3: 

demo := dieser 

text wind venwendet, 


#4: 

DIM (demo) = 27 



freq(demo) = 

'111112233336' 



n s v x i w d r t e 


Expression #8 (next page) shows the frequency table for the paragraph from above starting with: 44 bei 
der komprimierung ... 44 . (Don’t forget to write the text under quotes. The quotes are not visible on 
the DERIVE screen, they are visible in the Edit-Line.) 
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#6: paragraph := bei der komprimierung von texten laesst man sich von der unterschiedl ichen 

haeufigkeit der zeichen in deni zu codierenden text lei ten. wir wollen das an einem 
einfachen bei spiel demonstrieren , wobei wir nur kleinbuchstaben , zwischenraeume und 
satzzeichen verwenden wollen. dieser text wird verwendet. 


#7: 

#8: 


freq(paragraph) 

2 2 2 2 3 3 3 4 5 5 7 8 8 9 9 9 9 9 11 14 14 17 25 29 40 50 

_,fgp.kxvbzmouachlw s d t r i n e 


The procedure is performed as follows: 

Generate a node for every character appearing in 
the message. Label all nodes with their weights 
(= frequencies). 

Until there is only one node remaining with no 
arrow directed to it, do: 

Connect two nodes with minimal weights 
which are not end points of an arrow by a 
new node. The weight of this “parent node” 
is the sum of the weights of the “children”. 

The arrows are directed from the parent 
node to the children nodes. 

The arrows are named as 0 (the left edge) 
and 1 (the right edge) by convention. 


On the left you can find the structogram for 
the algorithm. We will follow the instructions 
and create the “Huffman-Tree” for the code of 
our message. 

Then we will check the efficiency of the code, 
apply it and demonstrate how to decode the 
encoded message into a readable form again. 


This is another definition (found in http://en.wikipedia.org/wiki/Huffman_coding): 

The process essentially begins with the leaf nodes containing the probabilities of the symbol 
they represent, then a new node whose children are the 2 nodes with smallest probability is 
created, such that the new node's probability is equal to the sum of the children's probability. 
With the previous 2 nodes merged into one node (thus not considering them anymore), and 
with the new node being now considered, the procedure is repeated until only one node re- 
mains, the Huffman tree. 

Steps 1 to 3: 

We select the pairs of nodes with minimal weights one after the other. 





© 
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Steps 4 to 6: Pair (2,2) is connected by parent node 4. Then we find pair (2,3) and connect them giving 
node with value 5. “d” and “r” are following in step 6. 



Inspecting the graph we see that we need to connect the “t”-node with the value 3 node in the second 
row according to the algorithm rules. I rearrange the base line and insert the connection resulting in a 
parent node with value 6: 




The minimum nodes to be connected by edges are form pair (4,5) giving the sum 9. It is again neces- 
sary to rearrange the binary tree in order to obtain a clear structure - without crossing edges (step 8)). 
Step 9 results in node 12. 





Finally we connect 6 and 9 and the last step gives the “root” of the binary tree (or Huffman-Tree) 
with value (or weight) 27. 
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Having connected 9 and 6 to 15 the branch starting with node 12 is remaining. Both have the same 
root (parent) 27. 

Only one vertex (27) with no arrow directed is remaining. According to our instructions the job is 
done. It is an easy check to compare the weight of the final vertex (the root of the binary tree) with the 
sum of all (absolute) frequencies which is 27. 

I used Johann Wiesenbauer's tool for plotting directed graphs which appeared in DNL#78. These are 
the expressions which result iwhen plotted n the final Huffman-Tree: 

' -11 -9 -7 -5 1 3 5 7 9 11 -10 -6 -3 -1 2 8 -8 -2 3 -5 

huff5 := 

.0 0000000002 222224446 

10 -1 4.5 " 

4 8 10 . 

hS := [{}, {}, {}, {}, {}, {}, {}, {}, {}, {}, {1, 2}, {3, 4}, {}, {}, {5, 6} ( {8, 
9} , {11, 12}, {13, 14}, {15, 7}, {17, 18}, {16, 10}, {20, 19}, {21, 22}] 

graph(huff5' , h5, 0.4, true) 

We observe that frequently used characters are located close to the root, rarely ones can be found in 
the “tops of the tree 46 . 

The codes for the characters are yielded by following the graph from the “root” to the “leaf 4 and not- 
ing the labels of the edges along the path. The code for the “i“ is 0101 (follow the blue numbers), the 
code for the “x“ is 0100, the “r“-code is 101 and the “v“ is encoded by 00011. You see again that rare 
characters result in long paths which are equivalent to long code words and frequently appearing char- 
acters give short paths and consequently short code words. The Huffman-Code is the code which 
needs the minimal number of bits. (This can be proved.) 

Here is the complete code - given as a matrix - followed by the encoded message: 

(demo is the message “dieser text wird verwendet.”) 

edrtix wv s n 

#10: hcodel := 

. 11 100 101 Oil 0101 0100 0011 0010 00011 00010 00001 

00000 . 

#11: mess := huffcode(demo, hcodel) 

#12: mess := 

10001011100010111010011011110100011001100100101101100001100011111010010110 - 

00011001101100000 

#13: DIM(mess) = 91 


The characters e, d, ... and the code words 11, 100, ... are strings. So they are entered under quotes: 
“e”, “d”, ...,”11”, “100”, ... 
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The encoding procedure is done by a small program huffcode: 

huffcode(plain , code, p, cplain) := 

Prog 

cplain := "" 
code := code' 

p := VECTOR(plain|i , i, DIM(plain)) 

#9: Loop 

If P = [] 

RETURN cplain 

cplain := APPEND(cplain , (SELECT(vil = FIRST(p), v, code))|lf2) 
p := REST (p) 

The encoded message is decoded by following the path starting in the root bit for bit (go left for 0 or 
right for 1) until reaching a leaf. There you will find the respective character: 

"100|0101|11|00010| results in: dies ... 

Of course, the code must be transmitted together with the encoded message (if the code is generated 
from the message). Each language has typical frequencies for the occurrences of the letters. If the 
partners agree - and the message is sufficiently long - you can do without sending the code and rely 
on the typical frequency of the letters in the respective language. 

Let's try decoding using huffdecode: 

huffdecode(codtxt , code, plain, z, zz, branch, k) := 

Prog 
k := 0 

plain := 1111 
code := code 1 
Loop 

If codtxt = 1111 
RETURN plain 

#14: codtext := REST(codtxt) 

branch := FIRST(codtxt) 

Loop 

z := SELECT (v|2 = branch, v, code) 

If z * [] exit 
codtxt := REST (codtxt) 
branch := APPEND(branch , FIRST (codtxt)) 
codtxt := REST(codtxt) 
plain := APPEND(plain , zj,lj,l) 

#15: huffdecode(mess , hcodel) = dieser text wird verwendet. 

It seems to work! 

Possible questions and problems for students: 

1 It is possible to read off the length of the encoded message from the Huffman-Tree. Can you 
do this? 

2 Compare the number of the bits of the plain text “demo” and its compressed form. If we had 
only 12 characters we would need only 4 bits applying conventional coding. Would we benefit 
of Huffman-encoding? 

3 Decode the compressed mess manually. 

4 Why does the “adaptive method“, which determines the code from the plain text (as we did it 
here) bring a real advantage only if applied for longer messages? 
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6 Do a web research for frequency tables of letters in various languages. 

7 Encode the message “this sentence will be compressed”. Develop the Huffman-Tree in order 
to find the code, encode and decode. 


The programs for encoding and decoding are not so difficult. The real challenge for me was writing a 
program which returns the code - such that I need not “plant” the Huffman-Tree. 

hufftree(plain , s := 0, plainh, tab, el, e2, f 1 , f 2 , ne, tree) := 

Prog 

plainh := freq(plain) 

tab := VECT0R([v.],l , [vj, 2 , 11 11 ] ] , v, plainh 1 ) 
tree := [[]] 

Loop 

tree := APPENDftree, [tab]) 

If DIM(tab) = 1 
If s = 0 

RETURN t ab 1 
RETURN REST (tree) 

#16: el := (FIRST(tab))|2 

If el4.2 = 1111 
el := [el] 

fl := VECT0 R([v4.1, APPEND( m 0 m , v|2)], v, el) 
e2 := (FIRST (REST (tab))) 4. 2 
If e2^2 = 1111 
e2 := [e2] 

f2 := VECT0R([v|l, APPEND( M 1 M , v|2)], v, e2) 

ne := [(FIRST(tab))j,l + (FIRST(REST(tab)))|l , APPEND(fl , f2)] 

tab := REST (REST (tab)) 

tab := APPEND([ne], tab) 

tab := SORT (tab) 

edrtxiw. n s v 

#17: hufftree(demo) = 

_ 00 010 011 100 1010 1011 1100 11010 11011 11100 11101 1111 _ 

Comparing with hcodel you will notice that both codes are different. Yes, it is true, there is no 
unique optimal code. We should further see that the encoded message using this code will have the 
same length of 91 characters. 


#18: democode 


edrtxiw. n s v 

00 010 011 100 1010 1011 1100 11010 11011 11100 11101 1111 


#19: demo_mess := huffcode(demo , democode) 

#20: demo_mess := 


010101100111000001111111000010101001111110010110110101111111010001111000011011010 - 

0010011010 


#21: DIM(demo_mess) = 91 

#22: huffdecode(demo_mess , democode) 

#23: dieser text wi rd verwendet. 


We will apply huftree on the longer message paragraph from above. Then we will encode and 
decode this text. 
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#30: par_code := hufftree(paragraph) 


#31: 

par_code := 

‘ d 
. 0000 

t 

0001 

n x 

001 010000 

V 

010001 

0 

01001 

u 

01010 

i 

0101100 

f 

0101101 


9 

P 

r 

a c 

h 

1 

w 

b 

z 


0101110 0101111 

0110 

OHIO 01111 

. 10000 

10001 

10010 

100110 

100111 


s k m i g 


11000 1100100 1100101 110011 1101 111 J 

#32: par_mess := huffcode(paragraph , par_code) 

#33: par_mess := 

100110111110110100001110110101110010101001110011010111101101101110011110111101100 - 

101000101011101010100010100100110100011110100000001111001101100010111011111000110 - 

000001101110011011100011011100011010111110000101010001010010011010000111011010101 - 

010001000111101101100001111100001101111000010001110101111100001110011011000001110 - 

111010100101101110101011101100101111110100011010000111011010110011111111010111110 - 

000111001101110100110100001111100111011001110101010101111010010000110111101101110 - 

010000111001101000111101000000011011000111111010001111001110010010110010110101101 - 

011001001001100011000111100110100000111011000101011100011011111101001111110011101 - 

111110100101011010111001111100001110011011001101111101110000101111110111110001101 - 

000011111001101001001110000001011011011110110111001010110010110010010011001101111 - 

101101100101101011010100101010011010111001011000111111010011001100101001111100001 - 

100000010111010011011100101011001011001111001011011100001111100001110010110011101 - 

110101011001111110101010001000010111000011100001100111100111111110101111100001110 - 

011010100011110110100101110010000111001101100100100110001100011110011100100101000 - 

011011111100011101101010001111010000000110110010110101100000101010001111011010010- 

111001000011100011100100 


#34: DIM(par_mess) = 1239 

#35 : huffdecode(par_mess, par_code) 

#36: bei der komprimierung von texten laesst man sich von der unterschiedl ichen 

haeufigkeit der zeichen in dem zu codierenden text leiten. wi r wollen das an 


einem einfachen beispiel demonstrieren , wobei wi r nur klei nbuchstaben , 


zwischenraeume und satzzeichen verwenden wollen. dieser text wi rd verwendet. 


My program hufftree2 has a tighter code and returns another - but also optimal Huffman-Code. 


#39: 


hufftree2(demo) = 


d 

000 


ret x 
001 01 100 1010 


10110 


n i 

10111 1100 


s 

11010 


v w 

11011 1110 


1111 J 

#40: huffcode(demo , hufftree2(demo)) 

#41: 00011000111010010011111100011010100111111101100001000111111011010011110011011100001- 

10010110 


#42: 


DIM (huff code (demo, hufftree2(demo))) = m 


p 
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hufftree2(plain , s := 0, plainh, tab, el, e2, fl, f 2 , ne, tree) := 
Prog 

plainh := freq(plain) 

tab := VECTOR ([v|l, [ [v j, 2 , 11 11 ] ] ] , v, plainh 1 ) 
tree := [[]] 

Loop 

tree := APPEND(tree, [tab]) 

If DIM (tab) = 1 
If s = 0 

#57: RETURN tab|l|2’ 

RETURN REST (tree) 
el := (FIRST (tab))|2 

fl := VECTOR ([v|l, APPEND( n 0 n , v*2)], v, el) 

e2 := (FIRST (REST (tab)))|2 

f2 := VECTOR ([v|l, APPENDC’l” , v*2)], v, e2) 

ne := [ (FIRST(tab))|l + (FIRST(REST(tab)))il, APPEND(fl, f2) ] 

tab := REST (REST (tab)) 

tab := APPEND([ne], tab) 

tab := SORT(tab) 


Default for s (second parameter in the parameter list) in hufftree and hufftree2 as well is 0. If you 
enter any other value then you will obtain another output: You can follow the Huffman-Tree growing 
(from its leaves and branches down to its root). I will demonstrate this using again demo: 


hufftree(demo, 1) 


i [..ii [i 
1 [n, ] 1 

1 [s, ] 1 

1 [v, ] 2 

1 [x, ] 2 

2 [i. ] 

, 2 

2 [w, ] 

3 [ , ] 3 

3 [d, ] 3 

3 [r, ] 3 

3 [t, ] 3 

6 [e, ] J 6 

3 [t, ] 


#25: 


x 0 
i 1 
w 0 
. 10 
n 11 
S 00 
v 01 
1 

[e, ] 
d 0 
r 1 


[s, ] 
[v, ] 

[X, ] 

[i, ] 
[w, ] 
0 

n 1 
[ , ] 
[d, ] 
[r, ] 
[t, ] 

[e, ] 


[e, ] 
d 0 
r 1 
t 0 
x 10 

l 11 


[x, ] 
[i, ] 
[w, ] 
. 0 
n 1 
s 0 
v 1 
[ , ] 
[d, ] 
[r, ] 
[t, ] 
[e, ] 


w 

0 



' 6 


10 



6 

11 

11 



- 

s 

00 




V 

01 



6 


1 





6 

[e, ] 
d 0 
r 1 
t 0 
x 10 

i 11 

w 00 
. 010 
n Oil 
s 100 
v 101 
11 


[w, ] 
0 

n 1 
s 0 
v 1 
[ , ] 
[d, ] 
[r, ] 
[t, ] 
x 0 
i 1 
[e, ] 


s 0 
v 1 
[ , ] 
[d, ] 
[r, ] 
[t, ] 
x 0 
i 1 
w 0 
. 10 
n 11 
[e, ] 
0 


12 


x 10 

l 11 

w 00 
010 
n Oil 
s 100 
v 101 
11 
e 0 
d 10 
r 11 


12 


15 


[d, ] 
[r, ] 
[t, ] 
x 0 
i 1 
w 0 
. 10 
n 11 
s 00 
v 01 
1 

[e, ] 
e 0 
d 10 
r 11 
t 00 
x 010 
i Oil 
w 100 
1010 
n 1011 
s 1100 
v 1101 
111 
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e 00 
d 010 
r Oil 
t 100 
x 1010 

i 1011 

27, 

w 1100 
. 11010 
n 11011 
s 11100 
v 11101 
1111 


I am sure that you can “read” the output. You can see step by step how the vertices are collected and 
how the weights are added. The last element gives the weight of the root and the complete code. 

Solution for task 7 (page 12): The encoded message should contain 120 characters. 

Huffman-Code on the TIs: 



I did not program (until now) the hufftree routine 
to generate a code. Maybe that one of the readers 
will transfer the DERIVE program from page 12 
to the Voyage 200 or to TI-NspireCAS? 

Let me know. 


ffl 1 gebra 

baTcl 

r fht ■) 

Other 

F'r gn 1 0 

[clean Up] 

i rr^-n 


^ rryn 


L 11 1 1 11 11 100" 11 1G1 11 11 0 1 1 11 "0101 11 "OK 

■ hufcode(deno ? codel T ) + cdeno 

11 1000101 11000101 1101001 101 11 1010001 10 (» 

■ hufdcode(cdeno , codel ^ ) 

"dieser text wird oerwendet. I 11 

■ din(cdeno^ 91 

dim Cede mo > 


(The Tl-functions are among the files which can be 
downloaded.) 


FUhK 
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The Josephus Problem 

Roland Schroder, Celle, Germany 

In 70 AC 40 rebellious Jews were captured in Rome which should be sold as slaves 
in punishment for their behaviour. In order to avoid their doom they agreed on a pro- 
cedure for mutual extinction: They formed a circle and every seventh in the row 
should be killed (continuing counting in the same direction). The remaining last one 
should commit suicide. The later historian Flavius Josephus chose a position that he 
remained as the last one - and he didn’t commit suicide. 

The story is wholly invented. In Scandinavia the following legend is passed on: In 
times when St Petrus strolled on earth he repaired on a ship with 15 Swedes and 15 
Norwegian. The ship got in a thunderstorm followed by distress at sea. Salvage 
seems only possible if half of the passengers will go overboard. St: Petrus provides 
the following counting method: The passengers form a circle. St. Petrus starts at a 
certain position counting until 9. Person number 9 has to leave the ship Gump from 
board) and the circle will be closed immediately. St. Petrus keeps the direction and 
counts again up to 9 - and the next person jumps (or will be thrown). The procedure 
goes on until 15 people are remaining. How had Petrus organized the starting posi- 
tions of the 30 passengers that the Swedes - which were preferred by him - had 
been saved? (Sorry for our Norwegian DUG-Members!) 

But this story is also imaginary. There are also versions with Christians and Turks, 
(and people from Vienna and Klagenfurt in DNL#52, Rudeger Baumann). The ma- 
thematical problem behind is: n elements are arranged in a circle and numbered from 
1 to n. Then every k th element is removed. Which element (Josephus) or which ele- 
ments (St. Petrus) will remain after e countings? 

Putting oneself in an affected person’s position, one would like to have an easy and 
quick algorithm available to find an advantageous position for oneself. In both tales 
narrated above the numbers are small, so the simulation of counting by paper and 
pencil will deliver the requested solution very soon. Taking numbers above n = 100 
makes manual simulation so laborious that it makes sense to look for a more elegant 
solution of the problem. The first mathematical treatment of the problem originates - 
according to the author's knowledge - from Leonhard Euler, who found a recursion 
formula, which will not be used now. 

Instead of this we will develop some DERIVE-functions. The function “jo(x,yk)” shall 
relieve us of the manual paper and pencil work and deliver immediately the surviving 
person. The meaning of the functions are self explanatory calculating some 
appropriate examples. We will do this now. 


[1] DNLs #52, 57 (Josephus Problem) & #53, 55 (Josephus Permutations) 

[2] http://web.me.com/ntheriau/iosephus.pdf (Generalization of the Josephus Problem, Tait's Algorithm)) 

[3] Donald Knuth a.o., Concrete Mathematics 
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#1 : rOO := VECTORCy, y, x) 

#2: r(12) = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12] 

g(v , k) VECTOR (v , n, IF(k = DIM(v), DIM(v) - 1, M0D(k , DIM(v)))) 

#3: n 

#4: g(r(12) , 7) = [1, 2, 3, 4, 5, 6, 7] 

f(v, k) := VECTOR (v , n, IF(k = DIM(v), DIM(v) + 1, M0D(k , DIM(v)) + 1), DIM(v)) 
#5: n 

#6: f(r(12) , 7) = [8, 9, 10, 11, 12] 

#7: h(v, k) := APPEND(f(v, k) , g(v, k)) 

#8: h(r(12) , 7) = [8, 9, 10, 11, 12, 1, 2, 3, 4, 5, 6, 7] 


h(v, k) puts together the two parts g(v, k) and f(v, k) of the vector v = r(x) in inverse 
order. Then the last element of h(v,k) is removed and the same procedure is applied 
on this newly generated vector. 

Taking to pieces - Inverse joining - Removing the last element. 


#9: i(v, k) := DELETE(h(v, k) , DIM(v)) 

#10: 10(12), 7) = [8, 9, 10, 11, 12, 1, 2, 3, 4, 5, 6] 

jo(x, k) := (ITERATE ( i (w, k) , w, r(x), x - 1)) 

# 11 : 1 


#12: jo (40, 7) = 24 

#13: jo (41, 3) = 31 

#14: jo (100 , 7) = 50 

Josephus stood on position 24 - and he survived. 

We produce a value table and the respective graph for the relation between the 
number of persons forming the circle and the position number of the surviving person 
with a fixed k (here k = 7, k = 5 and k = 3): 

#15: Joseph(y, k) := VECT0R([x, jo(x, k)], x, 2 ■ k , y) 

#16: Joseph(100, 7) 

#17: Joseph(100, 5) 

#18: Joseph(100 , 3) 

The tables are leading to scatter diagrams which look as follows: 
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The points are lying on lines with slope k. The accurate knowledge of the equations 
of the lines - their y-intercepts in addition to k - can help solving the Josephus prob- 
lem in another way. When among n participants every k th will drop out then the 
y-intercepts can be generated recursively (the proof is left for the reader). 



r 

' k-x ' 


#16: L(n , k) := ITERATES 

FLOOR 

k 

. k - 1 , 

+ 1, x, 1 , n 

y 


#17: L(40 , 7) = [1, 2, 3, 4, 5, 6, 8, 10, 12, 15, 18, 22, 26, 31, 37, 44, 52, 61, 
72, 85, 100, 117, 137, 160, 187, 219, 256, 299, 349, 408, 477, 557, 650, 
759, 886, 1034, 1207, 1409, 1644, 1919, 2239] 


There is one line with equation y = 7x + b which contains point [40; jo(40,7)]. As the 
y-value must be positive and less or equal x only one line is possible: y = 7x - 256. It 
contains point (40; 24). So follows: Josephus survived standing on position 24. 



The sequence L(40,7) can easily be found by paper and pencil: For calculating the 
successor of a n one has to find the next number b n > a n which is divisible by 6 - even 
then when a n is divisible by 6. Then a n +i = a n +b n /6. The procedure is terminated 
when a n +i >7-40. Then a n is the requested y-intercept and 280 - a n is the position 
to surive. 


(I tried to automate the procedure using SELECT, Josef): 

pos(n, k) := (SELECTfp > 0 a p < n, p, SUBST(VECTOR(k -x - b, b, L(n, k)), x , n))) 
# 21 : 1 

#22: [pos(40 , 7), pos(lGQ, 7), pos(10Q, 2)] = [24, 50, 73] 


I found a nice recursive algorithm to produce the jo’s [21 : 

tait(n, k) := 

If n = 1 
#24: 1 

M0D(tait(n -1, k)+k-l, n) + 1 
#25: [tait(40 , 7), tait(100, 7), taitClOO, 2)] = [24, 50, 73] 


See the Tl-treatment of JOSEPHUS on the next page, Josef. 


D-N-L#79 


Roland Schroder: The Josephus Problem 


The functions are named as in JOSEPHUS from 
ve. 


FIHRHCE* 


FlashRpp 


FHt T FE T FfiT T 

Other Prgm 10 Clean Up 


1 seqCy , y , 1 , x) + r Cx) 

'seafurnl n 1 /din(u) - 1 , k = din(y) 
se^wLnj , n , l , ^ nod( - k > din( y)), else 

1 seqfyTnl n /din(u) + 1, k = din(y) 
seq l y LnJ - n ’ \nodC k , din( + 1 , else 

augments ft o , k} , gfo , kW ■* hto , k} 


tait 

Mfllhk 


1 seq(y , y, 1 , x) 


Clean Up! I 


Done 


' seq^o [n] 7 n, 1 ? 

1 seqfo[n] ? n ? f ^ 


/" d i m( o) - 1 ? k = d i mt y) "1 


7 l_mod(k ? dim(y)) ? else 
fdin(o) + 1 ? k = dim(y) 


^[ VLI ,J 7 1 1 7 lnod( k , dinC v)) + 1 , else 
augment(f(y ? k) ? g(y ? k)) ■* h(y ? k) 


J + 90 

Done 
, dir'ii> 
Done 
Done 


FHt Y FE T FfiT T 

0ther|PrgmI0|Clean Up 


l_mod(tait(n -l ? k}+k-l ? n)+l 

■jot 40 ,7) 

■ jo<41, 3) 

■ JO( 100 , 7 ) 

■ tait(20, 7) 

■jot 20, 7) 


taxt < 40 , 7 > 


Calculating jo takes some time: 


FbT T 

Clean Up 


taxt < 40 , 7 > 


I am very soon Out of Memory! 


uxl=l 


I define the recursive function m Sequence Mode 


124. 


■ jot 40, 7) 

24 

■ 00(41,3) 

31 

■ jot 100, 7) 

50 

■ taitt20, 7) 

3 

■ jot20,7) 

3 

■ taitt 40 , 75 

Error: Memory 


131. 


Check the „last positions 6 * 


c 2 =.„ ait <cl[k] , 7 >,k,l,diri<cl>> 


Finally the “Joseph“-Scatter diagrams: 
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Surface #11: x 2 + y 3 + z 5 - 0 



•• •• 


plot (plot :: Implicit3d (x A 2 + y A 3 

x = -1.5. 
y = -1.5. 
z = -1.5. 


Scaling — Constrained) 


+ z A 5 , 
1.5, 
1.5, 
1.5), 



Surface #12: (x 2 + _y 2 + z 2 - 1) = x 2 z 3 + y 2 z 3 




DPGraph 


DERIVE 
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Nonlinear Regression, Logistic Regression for Binary Dependent Data, 
And Two-Stage Least Squares Regression for the TI-89 

MacDonald R. Phillips, don.phillips@qmail.com 

The routines in this folder solve nonlinear regression problems using the Gauss_Newton 
Method with Step-Halving, logistic regression problems for binary dependent data using the 
probit, normit, or complementary log-log link functions, and two-stage least squares regres- 
sion. 

NOTE: These programs are offered “as is.” I make no claim that they are entirely bug free, 
although I believe they are. If you encounter any problems with the programs, please send 
me an email so I can correct them. 

NOTE: These programs require the use of the Statistics with List Editor Flash application. 

My aim is to teach you how to use these programs, not to teach statistics. Thus, when I 
mention the ANOVA table or logit link function, I assume you already know what they are 
and/or when they are used, or are learning about them either in a class or on you own. 

Fitting data to an arbitrary function is more of an art than a science. Convergence to a solu- 
tion can be very sensitive to the initial starting values, i.e., guesses. And, there may be more 
than one solution or local minimum around the starting values. If you get error messages 
such as singular matrix, this may mean that there is no solution or you need to choose a dif- 
ferent set of starting values. 

The routines are in a group file, AdvReg.89g. 

Use Tl-Connect to transfer them to your cal- 
culator. There is a menu program; this 
needs to be run in order to use the regres- 
sion routines. The custom menu sets up 
four pull-down menus: Tools, Nonlinear, Lo- 
gistic, and TSLS. 

Don't worry about the changed menu bar. [2ND] 
[3] restores the default menu. 

All regression routines use a data matrix created with the Data/Matrix Editor. The data ma- 
trix consists of the variables in any order. The first row of the matrix must be the variable 
names; I recommend one-letter names. In any case, just make sure they do not conflict with 
any of the variable names in the AdvReg folder or Tl reserved names. (There are no vari- 
ables with one-letter names in the folder.) When performing regression there is no need to 
use all of the variables in a dataset; this means you can do many different regressions on a 
dataset without having to enter a new data matrix each time. It also means you can do 
“model building” and test the significance of adding variables to a regression equation; this 
ability is one of the routines. 

Give the data matrix a name and save it. Open the Tools menu (FI) and select AddDS(). 
You will be prompted to enter the name of the data matrix. When you run NonLin(), Logit(), 
or TSLS() you will be asked to select a dataset from the list of datasets created with 
AddDSQ. AddDSQ also archives the dataset. 


Y FjT Y FHt Y | 

iWHHon 1 i near iLog i si i c ITSLS 1 

1 : ClrHone 

■fllfBISIffilSH 


3 : AelDSO 
4: GeiUars£ ) 



menuO 

liDYREG RAD liUTD FUNC 0/30 
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Tools Menu 

The options under the F1:Tools menu are straightforward. Option 1 clears the home history 
screen. 

Option 2, AddDSQ, prompts you to add the name of a dataset to the list of datasets; this list 
is how you tell the programs what dataset to use. It also archives the dataset. 

Option 3, DelDSQ, deletes a dataset from memory and the dataset list when you select its 
name from dataset list. 

Option 4, GetVars(), displays the variables in a dataset, in case you forgot what they were, 
as well as the number of observations in the dataset. Choose the dataset from the list pre- 
sented. 


Nonlinear Regression 

The nonlinear regression routine can also handle weighted regressions as well as regres- 
sions with complex data. However, I’m not sure one can do weighted regressions with com- 
plex data; at least I’ve never seen an example against which I can test the program. The 
nonlinear regression program will, of course, do linear regressions. 

Example 1 

Most regressions are linear regressions or can be transformed into linear regressions. 

Some, however, cannot be transformed. For instance, an exponential equation of the form 

Pi =K X x (v, - 1790)) + b 2 

may model the growth of the U.S. population by decade from 1790, but it cannot be trans- 
formed into a linear regression problem. (If the b 2 variable was not there, it could be trans- 
formed into a linear regression problem.) A nonlinear regression program is needed. Press 
F2, Nonlinear, and select the first menu item, NonLin(), and then press enter, once or twice 
as needed. This sets up a data input form. 

The first item is “Select dataset.” Press the right arrow key to see the list of datasets. Scroll 
to “pop,” if needed, and press ENTER, “pop” has the U.S. population figures, in millions, 
from 1790 to 1990, by decade. Now, press the down arrow key to enter the regression equa- 
tion. The next line is used to enter the regression equation; the equation there is the one 
displayed above. So enter p=b0*e A (b1*(y-1790))+b3. Press the down arrow key to enter a 
list of the parameters and their initial guesses. Enter {b0=20, b1=.03, b2=10}. Finally, enter 
the weight variable; if none, enter the number 1 . The screen should look like this: 
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1! I I II 

After the data is input, press enter to begin computing the regression. The program keeps 
you informed as to what is going on. It first sets up the necessary matrices, etc., needed to 
compute the regression. After that, each iteration is displayed along with the current sum-of- 
squared-errors. At the end, a message will be displayed indicating whether or not the routine 
converged to an answer. (As seen below, the convergence criteria can be changed.) 


[ Bf ^ I ffeil blfegffei' f feaseagiBEni iia Bar ] j 


r F1 t Y FEt Y FI:t Y FHt V 7 

|Tools|Nonl inear|Logistic|TSLS| ] 

Convergence criteria net! 
Press ENTER to quit! 

■nonlinO Done 

■ f eqn 

P = 2. 507464E "7 ■( 1 . 010549) y - 39. 29G524 

FeqN 


liDVFiEG RFlD FlUTD FUNC 4/34 ISIllHa 

| liDVFiEG RAD AUTD FUNC E/34 1 


The other options under F2 (Nonlinear) display the output of the regression. 

Option 2, FeqN, displays the fitted equation. In this case it is 

p = 2.50746^ 7 (1.01055) r -39.29052 

(It is unfortunate that the calculator simplifies the answer instead of leaving it in the form of 
an exponential equation.) 

Option 3 under the F2 menu, OutN, displays a matrix of the parameters, their values, stan- 
dard errors, t values and probability(t). For this regression the output is 


( FiT T FT? Y Y FHT t - 

|Tools|Honl inear |Logislic|T5L5| 


- t i 

P = 2. 

5Q7464e "7 ■( 1 

GlG549) y - 

39. 29G524 

"Parm" 

"Value" 

"STD" 

'7(18)" 

"Pr ob(7)" 

■ outn 




60 

36.054 

4.11342 

8.76498 

6.51895£“ 8 


■"Parn" 

"Ualue" 

"STD" 

"t(18)" 





5.1456£~ 14 


bG 

36. G54QG2 

4.113416 

8.76497E 

bl 

.010494 

.00051 

20.73862 


bl 

. G1G494 

. GQG5G6 

20.73361* 

bl 

-39.29052 

5.88023 

-6.6818 

2.87623£ ,HS 


,b2 

-39.29G524 

5.88023G 

-6.6818G 







llPutNl 


I ADVREG FiliD AUTD FUNC 5/50 


(The 18 in “t(1 8)” is the degrees of freedom of the t statistics.) 


Option 4 under the F2 menu, Iter, displays a matrix of the iterations the program went 
through to reach the estimated values of the parameters. The iteration number, or sub itera- 
tion number, parameter values, and sum-of-square errors are displayed for each iteration. 


( fit t FT? Y Fi^ Y fht t 
|Tools|Honl inear |Logislic|T5L5| 


bl 

GG . G3GGGG 
7 . 029135 

^4 ,024311 

2 . 0165QO 

89 . QG899G 


b2 

10.000000 

7.082649 

5.526983 

-4.419G77 

-25.856442 

-IF 1 T77tn 


M SSE M 

133373G23. G55 
2543253. G1592 
231374. 97G3G9 
884G. 389G57 
75438. 156331 
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Option 5 under the F2 menu, ANOVA, displays the analysis of variance matrix. 


"Source" 

"DF" 

"SS" 

"MS" 

""F" "Prob(F)" 

"Reg" 

2. 

122823. 

61411.4 

3331.81 7.47334£~ 24 

"Error" 

18. 

331.773 

18.4318 

ft It If ft 

"CTotal" 

20. 

123154. 

ft If 

II II If ft 


r FIT Y F£t Y F]:t Y FHt Y 'i 

|Tools|Nonl inear|l_ogistic|TSLS| I 

■ 

anoua 

'"Source" "DF" "SS" T 
"Reg" 2.000000 122S22. 710204 61 
"Error" IS. 000000 331.772790 IE* 
."CTotal" 20.000000 123154. 4920S4 "" 

mm 

1 1 

1 ADVREG RAD AUTU FUhK 1/30 1 



The “dotal” in the ANOVA matrix stands for corrected total degrees of freedom and sum of 
squared errors. The corrected totals are used when there is an intercept in the regression 
equation. If there is no intercept, then the uncorrected totals are used. However, R A 2 and 
adjR A 2 are always computed with the corrected totals. (See a “Cautionary Note About R A 2” 
by Tarald O. Kvalseth in The American Statistician, November 1985, pp. 279-85.) 


Option 6 under the F2 menu displays the R square, adjusted R square, and standard error of 
the regression statistics. For this problem they are: 0.99731 , 0.99701 , and 4.29323. 


( FiT T FT? T Y FHT t 

|Tools|Honl inear |Logislic|T5L5| 


■ ClrHone Done 



'"Rsq" 

.997306 

■ rsqn 

" flRsq 11 

. 997007 


."SE" 

4.293230. 


[bsqni 


ADVREG RAD liUTD FUhK £/30 


Option 7 under F2, PrdNonl({ }, 1, .95), computes the predicted values for the mean and indi- 
vidual values of the dependent variable. The default weight is 1 (indicating no weight) and 
the default confidence interval is .95 for a 95 percent confidence interval. Enter a list of the 
independent variables and their values. In this case find the estimated population for the 
year 2000. Enter y=2000 in the list. 


r FIT Y F £t Y FjT Y FHt y~ 

|Tools|Nonl inear |l_ogistic|TSLS| 

1 

■ ClrHone 


Done 

■ prdnonlCCy = 2000> ? 1 , . 95) 



r "PUalue" 

P = 287.300537 

II II -1 


"SeQ^SeY" 

4. 193426 

6.001387 


"LowerCI " 

278.490476 

274.692091 


. "UpperCI " 

296.110599 

299. 908983 J 

IPrdNonKCy 1 

=2000> _1 _ _95>l 1 

IADVREG RAD AUTD FUhK £/30 1 


"PValue" 

p = 287.30054 

ft ft 

’Sey/SeY" 

4.19343 

6.00139 

’LowerCI" 

278.49048 

274.69209 

’UpperCI" 

296.11060 

299.90898 


The first line of the matrix gives the predicted value of the equation for the year 2000, 287.3 
million people. The second line gives the standard errors of the mean and individual values 
of the dependent variable, in this case p. The third and fourth lines give the 95 percent con- 
fidence interval for the mean and individual values of p for the year 2000. 
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r FIT Y Y FjT Y FHt Y | 

|Too 1 s |Non 1 i near |Log i si i c |TSLS | 

■ ClrHone 


Done 

■ prdnonl( £y = 

2G1G} , 1 , .95) 



' "PUalue" 

P = 323.435022 

II II -| 


"SeQ^SeV" 

5.72G2G9 

7. 152106 


"LowerCI 11 

311.417303 

3G3. 4G9GG5 


. "UpperCI 11 

335.452735 

333. 46103sJ 

PrdNonKCy 

=2010> _1 _ _95> 

ADVREG RAD AUTU FUNC E/30 


r F1 t Y FEt Y Fjt Y FHt Y | 

|Too 1 s |Non 1 i near |Log i si i c |TSLS | 

■ ClrHone 


Done 

■ prdnonl( £y = 

2G2G} , 1 , .95) 



' "PUalue" 

P = 363.567475 

II II "1 


"SeQ^SeV" 

7.65G539 

3.772331 


"LowerCI" 

347.494239 

345. 136441 


."UpperCI" 

379.64G66G 

381.998509] 

PrdNonKCy 

=2020> _1 _ _95>l 

liDVREG RAD AUTU FUNC E/30 


Finally I'd like to plot the pop-data together with the regression line (Josef) 




1 POp T 1 

[y 1790 1SGG 1S1G 1S2G 1S3G 184G ► 

*4 1950 I960 197G 198G 1990} ■* popx 

£1790 18GG 181G 182Q 183G 184G ► 


[p 3.929000 5.303000 7.239000 9.633> 

i tt n-~in g Tnn t -~iTn n Tn i -~i rirr d 


POPX 



aduKt3\POP3Ka Plot i 


Plot Type . . . 
Mark 


Scatters ■ 

Box* 

x 


adoregxpopx 


y 


adoregxpopyl 




1 


Use Freq and 

Categories? NO* 

Krs-'.: w ^ ^ 

.. , , v .. , 




.. .... .. .. , 



0 \i'\ ir 




^CEnter=5FlUE!) 


t!ESC=CnHCEL^; 

ADVREG RAD AUTD 

FUNC 



ifl^Y FF? Y f:=t y fht y ft Y Fe^ Y i 
| t f — |fl 1 gebra |Ca 1 c |other |Pr gn 1 0 |C 1 ean Up | [ 


[y 179Q 18GG 1810 1820 183G 184G ► 

•4 195G I960 197G 198G 1990} ■* popx 

£1790 18GG 1810 182G 1830 184G ► 

■ pop T 2 

[p 3.9290GQ 5.3G8000 7.239000 9.633> 
■ 4 . 323 203.211 226.542 248. 71 } ■* popy 

£3.929000 5.303000 7.239000 9.633Q 


£ 


iTmnY fet Y”f5 YpTYfF^Y FfiT ytt y~~ 
' |Zoon |Ed i i | - ' |R 1 1 |sty 1 e I* .. \ 


JLOTl- 1 

Plot 4: 

Plot 3: 

Plot 2: 

-■Riot 1 : Lilil □ y:adurt3\P0py 

■^■y 1=2. 50746 ■ 1G " 7 ■( 1 . 01055) x - 39.29052 
y2=l 

y 5 = 

y4= 

y5= 





Option 8 under the F2 menu, MB(), is for “model building.” If you added one or more vari- 
ables to the previous regression, MB() will compute the F statistic and probability associated 
with adding the variable(s). 

Option 9 under the F2 menu allows you to change the convergence criteria by setting the 
maximum number of iterations and subiterations and the criteria for the percentage change 
in successive sum-of-squares values. The values I have set are 30, 10, and 10" 8 . 

(NOTE: NonLin() may be used for linear regression 
also, that is, where the equation is linear in its pa- 
rameters. There are no restrictions on the inde- 
pendent variables. They may be any differentiable 
function. For instance, if x is an independent vari- 
able, it may occur in the equation as x 2 or x 5 , etc., or 
SIN(x), LN(x), EXP(x), etc. When using NonLin() for 
linear regression, you may set the initial guesses of 
the parameters equal to 1.) 


( FiT T FF? Y FF^ Y FHt t 
|Tools|Honl inear |Logislic|T5L5| 


ConUQKSftnCQ CKitftKiO 


Max Iterations: |3G 

Max Subiteriations: 

110 

Converge Criteria: 

|e-8 

, CEnter=0K } 

CESC=CflHCEL^ ; 


1 CIrHone 


Done 


CriteriaO 


ADVREG RAD AUTD 

FUNC 1/30 


You can find more worked problems at the end of this article, Josef 
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Example 2: Weighted Regression 

The weight variable may be specifically included in the dataset, or it may simply be one of 
the independent variables already in the dataset. The weights are usually put in an n by n 
matrix along the diagonals. However, for large datasets such a matrix can be too big to be 
handled by the Tl-89’s limited memory. So I developed a routine to that uses the weights in 
a vector (list) instead of a matrix. For anyone interested the function is ‘dmmul’. 


The dataset “dat” is used for the weighted regression. You may use the GetVars() program 
under FI to see the variable names and the number of observations in the dataset. 



'spc' is the sales of electricity per customer 
in a state; *y' ' s the per capita income in that 
state; 'ep' is the price per kilowatt hour; and 
'gp' is the price of an amount of natural gas 
that that the energy equivalent of a kilowatt 
hour of electricity. The data is to be 
weighted by the inverse of the income per 
capita, y. 


The equation we estimate is spc = a + b1*ep + b2*y + b3*gp with a weight of 1/y. Since this 
is a linear regression, enter the initial guesses as 1. 

The fitted equation is: spc = 0.45728*y - 7063.7816*ep + 1245.65916*gp + 40060.28733. 



The parameters and their standard errors and the ANOVA table are: 
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The predicted value for y = 7000, ep = 4, and gp = 1 .5 is spc = 1 6874.6. Notice that the 
weight is entered as a number, i.e. 1/7000 for 1/y. The standard errors of the mean and pre- 
dicted values are computed as well as the confidence interval for each. 


( FiT T FT? T Y FHT t 

|Tools|Honl ineaHLogistic|T5L5| 


T- 


CTot-al 11 49.0000 391886.2360 



■ prdnonl[^ -:I y = 7QQQ ep = 4 gp=1.5> ? -^fr 

' "PUalue" spc = 16874.6119 1111 

"SeyxSeY" 1337. 5770 5195.7693 

"LowerCI 11 14182. 2Q9Q 6416. Q7Q8 

. "UpperCI 11 19567. Q147 27335. 1529. 

... 000, ep=4,gp=1.5>, 1/7000, ,95>l 

ADYREG RAD AUTD FUNC £9/30 


Example 3: Regression with Complex Data 

The complex dataset is “datl ” with variables xl , x2, and y. It has only 4 observations. 


r FIT Y F£t Y F]:t Y FHt y 

IToolslHonl inear |Logisiic|TSLS| 

1 







*' " ,== ^ === =M 



Choose Dataset datl-* 



, CEnter=0K 

CESC=CflHCEL _ > 


■ Clr-Hone 


Done 

GetUarsO 1 




The input for the regression is: y = a + bl *x1 + b2*x2 with a weight of 1 , meaning no weight. 
The initial guesses are all 1. 


r F1 t Y F£t Y Fjt Y FHt Y 'i 

|Tools|Honl i near |Logi stic |TSLS| ! 

■ C 

f Nonlintor Input 


Select dataset datl-* 

Dne 

Equation: |y=a+bl*xl+b2*x2 

Parameters: la=l , b 1 = 1 , b2=l> 

Weight variable: H 

, CEnten=0K 5 CESC=CflHCEL^ 


■ getuarsQ Done 

NonLinO 

TYPE + EENTER]=DK AND EESC]=CAHCEL 

The fitted equation is: 


( fit t FT? Y Fi^ Y fht t 
IT oolslHonl inear |LQgislic|T5L5| 


■ ClrHone Done 

■ get-oarsO Done 

■nonlinO Done 

■ f eqn 

g = ■ 98969 ■ xl + 4. 00G09 ■ x2 - ■ 85900 + (2. Q 


|f egn 


ADYREG RAD AUTD FUNC H/30 


y = 0.98969*xl + 4.00009*x2 - 0.83900 + (2.02107*xl + 2.9383 l*x2 - .59506)*/ 

where /' is the square root of -1 . 


Logistic Regression 

This program computes the logistic regression for binary dependent data using the logit, pro- 
bit (normit), or complementary log-log link functions. Binary dependent data is in the form of 
Os and Is, where 1 signifies the occurrence of an event and 0 its nonoccurrence. The out- 
put is a regression equation that can be used to predict the probability of an event happening 
given a set of values for the independent variable(s). 

The dataset may have a frequency variable or two variables denoting the results of a bino- 
mial experiment. The two variables are the number of successful events out of the total 
number of trials. If the dataset is from a binomial experiment, there is no dependent variable 
to enter; the program will create it from the events and trials data. 
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Example 4: Logistic Regression 


The F3 menu is for computing the displaying the results of a logistic regression. Option 1, 
LogitQ, is for entering the information and computing the regression. It sets up an input form. 
The first item is to select a dataset. Select “ingot” to be used in this example. Next, enter a 
list of the independent variables. The variables for this example are {h,s}. Next, you are 
prompted to enter the link function; use the logit link function. (The other link functions are 
the Probit and Complementary LogLog functions.) Next, you will be prompted for a fre- 
quency variable. The options are “No” for none, “Yes” for a frequency variable, and 
“Events/Trials” for a binomial experiment. Select “Events/Trials.” The last two options are to 
change the maximum number of iterations and convergence criteria. Leave them at 30 and 
E' 8 for now. Press Enter to continue. 


r FIT Y Y FjT Y FHt Y 

It i .- lu i ■; l-..-. I i 4 ■; .- Itci cl 



Logistic Input Y 


Select dataset ingot* 

1 

2. 

2. 

3. 

4 

1 

Independent Oars: kh,s> 1 

LinkFunc Logit* 

Freq Uan? Eoents-^Tnials* 

Max Iterations: |30 

Converge Criteria: |e - 8I 


Losrit O 

ADYREG RAD AUTD FUNC 7/30 



Having selected “Events/Trials”, you are now 
prompted to enter the events and trials vari- 
ables. For this example they are e and t. 
Press Enter. 


( FIT T FT? Y F3^ Y FHT ' 
|Tools|Honl inear |LogisliG|T5L5| 


Logistic Input 


Events Variable: 
Trials Oar i able: 
, CEnter=QK > 


h: 


<ESC=CRNCEL> 


ITS 9“ 


"FT s 
-A 

^ 2 - 

ZD 2 - 

3. 

4 


Losrit O 


ADYREG RAD AUTD 

FUNC 7/30 


The program will now run and take several minutes to complete. A message will be dis- 
played indicating whether the program was successful in estimating the regression. 


(If you had selected “Yes” for a frequency variable, you would have been prompted to enter 
the name of the frequency variable. Then you would be prompted to enter the name of the 
dependent binary variable. If you had selected “No” for frequency or binomial experiment 
variables, you would have been prompted to enter the name of the dependent binary vari- 
able.) 


Option 2 under the F3 menu, FeqL, displays 
the fitted equation. For this example, it is: 

0.056771 *s + 0.082031*h- 5.55917 


f F1 t Y F Y F3t Y FHt Y s 

|Tools|Honl inear|Logistic|TSLS| | 



1 

16 

27 

4 



3 

13 

51 

1 



0 

1 

51 

2. 



0 

1 

51 

2. 



.0 

1 

51 

4 J 

■ logitO 





□one 

■ feql 

.056771 -s + .082031 ■ 

h - 5.559166 

feql I 

ADYREG 

RAD AUTD FUNC 9/30 




Option 3, OutL, displays a matrix of the parameters, their values, standard errors, and the 
Wald chi-square statistics and probabilities. 
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r FIT Y Y FjT Y FHt ’I 

|Tools|Nonl inear |Logisiic|TSLS 


■ 

logitO 



Done 

■ 

feql 

.056771 ■ 

£ + .082031 

■h - 5.559166 

■ outl 





r "Parn" 

"Ualue" 

"StdErr" 

"UChi2tl>" 


intrcpt 

-5.5592 

1.1197 

24.6502 


h 

.0820 

.0237 

1 1 . 9452 * 


.s 

.0568 

.3312 

.0294 

ms 1 

ADYREG 

RAD liUTD 

FUNC 10/30 l 


"Parm" 

"Value" 

"StdErr" 

"WChi2(l)" 

"Prob(W)' 

intercept 

-5.55917 

1.11969 

24.6502 

6.87383E - 

h 

0.082031 

0.023734 

11.9452 

0.000548 

s 

0.056771 

0.331213 

0.029379 

0.863906 


floolslh 


fT? Y Y FHt T 

|Honl inear |LogistiG|T5L5| 


outl 

"Parn" 

intrcpt 

h 

s 

1 1 ratio 


"Ualue" 
-5.5592 
.0520 
. G568 


' "Chi2" 
.11.542 


"StdErr 11 
1.1197 
.0237 
.3312 
M DF M 


M UChi2t 1 > " 
24.8502 
1 1 . 9452 * 

.0294 

"Prob" 


842820 2 . 000000 . 002983. 


llratiol 


liDYREG RAD AUTU 

FUNC 11/30 


Option 4, LLRatio, displays the -2 log likelihood ratio 
that tests the significance of the covariates, that is, of 
the independent variables taken together. The out- 
put is: 

"Chi2" "DF" "Prob" 

11.6428 2 0.002963 


Chi2 is the chi-square statistic, DF the degrees of freedom, and Prob the probability of ob- 
taining that value by chance. 


Option 5, PrdLogt, computes the logit (or probit or complementary log-log) of “p,” where “p” is 
the probability of the event occurring given a set of values for the independent variables, the 
value of “p,” and the confidence interval of “p.” If h=7 and s=1 , entering PrdLogt({h=7,s=1}, 
.95) produces: 


( FiT T FT? Y Y FHT t - 

|Tools|Honl inear |Logislic|T5L5| 


T. 


.0588 


T "Chi2" 

L 1 1 - 642E 


1 1 ratio 

. 842820 
1 prdlogt( £h = 7 s = 1 > 
'"LogitCprobV 

-4.928180 
, M C. I. CprobV 


3312 .0294 ► 

"DF" "Prob" 1 

2.000000 . 002983J 
.95^ 

M SE M "Prob" ' 

.749868 .007188 

.001662 .030517. 


FrdLofft <<h=7,s=l> , ,95>l 


FUNC 1E/30 


"Logit(prob)" 

- 4.92818 

"C.I.(prob)" 


"SE" "Prob" 
0.749866 0.007188 
0.001662 0.030517 


Reading across, the value of logit(p) is -4.92818, the standard error is 0.74987, and the 
value of p is 0.007188. The 95 percent confidence interval around p is 0.00166 to 0.03052. 
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Two-Stage Least Squares 

Two-stage least squares is the most widely used single-equation method for estimating si- 
multaneous system of equations. Let Y be the endogenous or dependent variable in the 
system and X the exogenous or predetermined variables. The equations to be estimated are 
of the form: 


y = Y 1 *P + X 1 ^ + u 

y is an n by 1 vector of observations on the “dependent” variable. 

Yi is an n by g matrix of observations on the other endogenous variables included in the 
equation. 

P is the g by 1 vector of coefficients associated with Y t 

Xi is the n by k matrix of observations on the predetermined or instrumental variables ap- 
pearing in the equation. 

y is the k by 1 vector of coefficients associated with Xi. 
u is the n by 1 disturbances in the equation. 

The problem of applying OLS to the above equation is that the variables in Y t are correlated 
with u. The essence of two-stage least squares regression is the replacement of Yi by a 
computed matrix Y_hati, where hopefully the stochastic element is purged, and then per- 
forming an OLS regression of y on Y_hati and X 2 . 


The matrix Y_hati is computed in the first stage by regressing each variable in on all the 
instrumental variables in the complete model and replacing the actual observations on the Y 
variables by the corresponding regression values. Thus, 

Yhati = X*(X t *X)' 1 *X t *Yi 

where X = [Xi X 2 ]. X is the n by k matrix of observations on all the instrumental variables in 
the complete model, X 2 being the matrix of observations on those instrumental variables ex- 
cluded from the equation under study. 

In the second stage y is regressed on Y_hati and X 1 . The equation for the 2SLS estimates 
can then be written as: 


-yT * x *( X r * X )-> *7 1 

* 

* 

i 

* 

~b 


-yT *x*(X T *xy' *X T *y 

1 

* 

-K- 

Jr<j 

* 

■jf 

* 

i 


c 


1 

* 

1 


where b is the vector of coefficients on the other endogenous variables and c is the vector of 
coefficients on the predetermined variables in the equation, including the intercept if any. 


The following two examples are models from political economics: 
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Example 5: 

The data used in this example are for a simplified model designed to explain variations in the 
consumption and price of food. The data are from Kmenta, pp. 563-65. 

The variables are: 

q = food consumption per head 

p = ratio of food prices to general consumer prices 

d = disposable income in constant prices 

f = ratio of preceding year’s prices received by farmers to general consumer prices 
a = time in years 

The endogenous (dependent) variables are q and p. The exogenous (independent) variables 
are d, f, and a. 

Estimate the following equations: 
q = yO + pi*p + yl*d (the demand equation) 

q = yO + pi*p + yl*f + y2*a (the supply equation) 


Press F4: 1 (TSLS) ENTER to begin the program. Select the “kmenta” dataset from the pull 
down menu. 

In the equation box enter q = p + d. 

(Note: you do not enter the coefficients for the equation. That is done by the program.) 

In the Endog. Vars. box enter in a list the dependent variables in the dataset: {q,p}. 

In the Exog. Vars. box enter in a list the independent variables: {d,f,a}. 

From the Intercept? pull down menu select Yes for an intercept. 

(Note: According to the SAS statistical software, if the intercept is set to No, the definition of 
the R A 2 statistic for two-stage least squares is changed to 1 - (Residual Sum of 
Squares/Uncorrected Total Sum of Squares.) 

From the VarDef pull down menu select Deg. Freedom with which to calculate the variances. 
(The other option is to select # Obs. for number of observations.) The input form looks like 
this: 


r FIT Y Y F]:t Y FHt Y 'i 

|Tools|Nonl inear |Logistic|TSLS| 


'q 

P 

d 

f 

a ] 


98.485 

100.323 

87.40O 

98 

1 


99. 187 

104.264 

97.600 

99. 100 

2 


102. 163 

103.435 

96.700 

99. 100 

3 


101.504 

104.506 

98.200 

98. 100 

4 


104.240 

98.001 

99.800 

110.800 

5 


1 FiT 

qq 

inn snn 

inn onn 

£ 

kmenta 

ADVREG 

RAD liUTD 

FUhK 




r FIT Y Y FjT Y FHt y 

It t k i i j : It~i ■- 1 


1 1 

¥ TSLS Input 


Select dataset kmenta-* 

a 

1 

2 

3 

4 

5 

Equation: |g=p+d 

Endog. Uars: Kq,p} 

Exog. Uars: Kd,f,a> 

Intercept? Ves-* 

UarDef Deg. Freedom-* 

, CEnter=OK ^ CESC=CnNCEL^ 

TSLSO 

ADYREG RAD AUTD FUhK H/3Q 


Press ENTER to begin the program. 


P 32 


Don Phillips: Nonlinear and other Regressions 


D-N-L#79 


Press F4: 2 (FeqTS) to display the fitted 
equation. 

It is: q = 0.314382*d - 0.243708*p + 94.614861 


103.522 

06.490 

96.400 

110.500 

17 

99.929 

104.016 

104.400 

92.500 

10 

105.223 

105.769 

110.700 

09.300 

19 

.106.232 

■slsO 

113.490 

127. 100 

93 

20. 

Done 


■ f eqts 

g = ■ 514532 ■ d - ■ 2457Q3 ■ p + 94.614361 



Press F4: 3 (OutTS) to display the parame- 
ters, their values, standard errors, t-values, 
and the t probabilities. 



q = . 314382 • d - . 243708 ■ p + 94.614861 
■ out-t-s 


"Pams" 

"Ualue" 

J01 

-.243700 

70 

94.614061 

7 1 

.314302 


"StdErr-" M t(17) M 
. 096077 2.536601 

7.SS7363 11.9957=* 

.046745 6.725433 




■ f eqts 

q = . 314302 ■ d - . 243700 ■ p + 94.614061 


■ out-t-s 

e" "St-dErr" ,l t<17> 11 "Prob(i)" ' 

700 .096077 2.536601 .021200 

^4061 7.007363 11.995753 1.011700E-9 

02 .046745 6.725433 .000004 



Press F4: 4 (ANOVATS) to display the 
analysis of variance table. 


^4861 7.887363 11.995753 1.011780E-9 

82 .046745 6.725433 .000004 

■ anooat-s 



Press F4: 5 to display the R A 2, adjR A 2, and 
SE stats. 
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To compute the other equation run the TSLS program again and just change the equation to 
q = p + f + a. Everything else remains the same. 



The fitted equation is q = 0.252844*a + 0.256024*f + 0.240568*p + 49.448206. 
The other statistics can be displayed as shown above. 


Examr 


This example is based on Klein’s model 1 (1950). The endogenous variables are c, p, w, I, x, 
wsum, k, and y. The exogenous variables are klag, plag, xlag, wp, g, t, and yr. 


yr = year - 1931 
c = consumption 
p = profits 

w = private wage bill 
I = investment 
x = private production 
wp = government wage bill 
g = government demand 


t = taxes 

k = capital stock 

wsum = total wage bill 

plag = profits lagged 

xlag = private product lagged 

klap = capital stock lagged 

y = c + i + g - 1 (national income) 


Estimate the following equations: 

c = p + plag + wsum 
i = p + plag + wsum 
w = x + xlag + yr 

Initiate the TSLS program and select the klein dataset. 


-10 41.9 12.4 25.5 

-9 45 16.9 29.3 

-8 49.2 18.4 34.1 

-7 50.6 19.4 33.9 


-6 


klein 


50.6 19.4 

52.6 20.1 



k 

WSUM 

Plag 

xlag 

klag 

y 

182.6 

28.2 

12.7 

44.9 

182.8 

37.9 

184.5 

32.2 

12.4 

45.6 

182.6 

46.2 

^ 189.7 

37 

16.9 

50. 1 

184.5 

52.5 

192.7 

37 

18.4 

57.2 

189.7 

53.3 

197.8 
a 

38.6 

dn 7 

19.4 

57. 1 

A 1 

192.7 

1Q7 fl 

55.5 

■=i7 

klein 



Enter the first equation. For the endogenous variables enter {c, p, w, i, x, wsum, k, y}. 
For the exogenous variables enter {klag, plag, xlag, wp, g, t, yr}. 

Select Yes for the intercept and Deg. Freedom for the variance definition. 
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14 JIT _L 

1 - ; J-rH’M 


| 1 

f 

TSLS Input 



Select data 

set klein* 



Equation: 

c=p+plag+wsun 

. 9 


Endog. Gars 

: Ic, p, w, i , x, wsum, k, y} 

.2 

4 

Exog. Gars: 

|p lag, xlag, wp, g, t, yr> 

.5 


Intercept? 

Ves* 

.3 


GarDef Deg. 

Freedom* 

.5 


, CEnter=0K 

CESC=CFlNCEO 


TSLSO 


USE * AND * TD DF'EH CHOICES 


r FiT Y F£t Y F3 t Y FHt Y 

|Tools|Nonl inear |Logistic|TSLS| 

% 

199.9 

45.9 

17.3 

65 2G1.3 

53.5 

^ 201.2 

49.4 

15.3 

6G.9 199.9 

6G.6 

204.5 

53 

19 

69.5 2G1.2 

66. 1 

6 2G9.4 

61.3 

21.1 

75.7 2G4.5 

76. sj 

■tslsO 




Done 

■ f eqts 
c = . G173G2 

P + . 

216234 

■plag + . S101S3 ■ | 


1 ADVREG 

RAD liUTD 

FUNC 15/30 




The fitted equation is: 

c = 0.017302*p + 0.216234*plag + 0.810183*wsum + 16.554756. 

To compute the other two equations just change the equation in the TSLS input form. 




TSLS Input 


Select dataset klein* 


Equation: |j=p+plag+klag| 


Endog. Gars: Kc , p , u , i , x , usum , k , y| 


Exog. Gars: Kklag, plag, xlag, up, gj 


Intercept? Ves* 

GarDef Deg. Freedoms 

< CEnter=OK > CESC=CnHCEL^ 


TSLSO 




r F1 t Y F£t Y F3t Y FHt Y 'i 

|Tools|Nonl inear |Logistic|TSLS| 

.5 
.6 
. 1 

.3. 

3ne 


204.5 53 19 ^9.5 201.2 66.1 

^6 209.4 61.3 21.1 75.7 204.5 76. sj 

■ tslsO Done 

■ f eqts 

c = .017302 ■ p + . 216234 plag + .310133 w> 

■ tslsO Done 

■ f eqts 

i = . 150222 ■ p - . 157733 ■ klag + . 615944 ■ p> 


... 43576623#plasr+20- 278208914261 



ADVREG RAD AUTD FUNC 17/30 


The second fitted equation is: 

I = 0.150222*p - 0.157788*klag + 0.615944*plag + 20.278209 


f F1 t Y F£t Y F3t Y FHt Y 

It t Ili i 2 It~i ■- 1 



| l 

f TSLS Input ■> 

— f 

4 

Select dataset klein* 

■ 1 

.sj 

■ t 

■ f 

Equation: |u=x+xlag+yy| I 

Endog. Gars: Kc, p, u, i , x, usum, 

■ k, y| 

3ne 

Exog. Gars: Kklag, plag, xlag, wp, g| 

w> 

3ne 

■ t 

■ f 

Intercept? Ves* 
GarDef Deg. Freedom* 

i 

, CEnter=0K } £ESC=CflNCEL\ 

P> 

TSLSO 

ADVREG RAD AUTD FUNC 17/30 


( FIT T FT? T F3^ T FHt t 

|Tools|Honl inear |Logistic|T5L5| 


1 f eqts 

c = . G173G2 ■ p + . 216234 ■ plag + .S101S3w> 

1 tslsO Done 

1 f eqts 

i = . 150222 ■ p - . 157788 ■ klag + . 615944 ■ p> 

1 tslsO Done 

1 f eqts 

w = ■ 43633Q ■ x + ■ 133339 - xlag - ■ 396365 


I ADVREG 


And the third fitted equation is: 

w = 0.438859*x + 0.146674*xlag + 0.130396*yr + 1.500297 

(Note: Once the programs have been run at least once, all programs and functions in the 
AdvStat folder may be archived. DO NOT archive anything else in the folder. The datasets 
are archived by the AddDS() program.) 

I hope you find the programs useful and enjoyable. I had fun programming them. I have 
also programmed these routines for DERIVE 6.1 . If you would like them, just drop me an 
email. 


Any comments, suggestions, frustrations with the programs? If so, just drop me an email. 


References: 

J. Johnson, Econometric Methods, 2 ed., McGraw-Hill Book Co., New York, 1972. 

Jan Kmenta, Elements of Econometrics, MacMillian Publishing Co., New York, 1971. 

L. Klein, Economic Fluctuations in The United States: 1921-1940, John Wileys and Sons, 
New York, 1950. 

J.K. Binkley and Nelson, G. (1984), “Impact of Alterative Degrees of Freedom Corrections in 
Two and Three Stage Least Squares,” Journal of Econometrics, 24, 3, 223-233. 
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Some additional examples (Josef): 

The following data set is given: 


t 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

V 

2.5 

5.5 

-0.1 

0.9 

4 

1.1 

1.1 

3 

1.5 

2 


I plot the scatter diagram and assume that a damped oscillation - vertically shifted - might 
give an appropriate fit. 

v = a sin(b t)- e' ct + d. 

My initial guesses are: a = 6, b = 2, c = 0.5 and d = 1 . 






□ 

■ ■ ■ □ ■ ■ 

□ 

□ 

■ ■ □ ■ ■ ■ 



■ □ ■ ■ ■ ■ 


□ 

ADVREG RAD AUTD FUNC 


r FIT Y F£t Y F3t Y FHt Y 'i 

|Too 1 s |Non 1 i near |Log i st i c |TSLS | 


f Nonlintor Input 


Select dataset sinuss* 


Equation: u=a*sin< b*t}*e^ - c*t>+ 

Parameters: Ka=6, b=2, c=. 5, d=l> 

Weight variable: 11 

^CEnier=0K ) CESC=CFINCEO 

■ menuO Done 

NonLinO 

ADYREG RAD AUTD FUNC 1/30 


r FIT Y F£t Y F3t Y FHt Y 

|Tools|Honl inear |Logistic|TSLS| 

1 

■ ClrHome 

Done 

■ nonl in() 

Done 

■ f eqn 

u = 4. 7767S2 ■( . 799336)^ ■ sin( 1 ■ 992516 

■t)> 

...155946641*t >+2 .0318864027178 

ADYREG RAD AUTD FUNC 3/30 




Plotting the fit function shows a very satisfying result. 

Next is an example from our textbook “Mathe mit Gewinri ‘ (= "Mathematics with Profif’) m 
vol 4. 

Given is the distribution of ages of football players of English Premier League with a transfer 
sum above 1 Million Pounds. 


Beiapiel: Einer Zeitschrift entnehmen wir die Alter svertei lung der Spieler bei den Transfers in der 
britischen Premier League mit Transfersummen uber 1 Million Pfund: 


Alter 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 30 31 

32 

% der Transfers 

0 

1 

2,5 

5 

7,5 

14 

17,5 

20 

1 6 

9,5 

4 2 1 

0 


a) Suche eine geeignete Verteilungskurve und 

b) suche eine Kurve fur die kumulierten Werte (% der Transfers bis zum Alter von x Jahren). 


[11 H.D. Hinkelmann a.o., Mathe mit Gewinn 1-4, hpt Vienna 
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The problem is to find an appropriate distri- 
bution curve and an curve for the cumulated 
values (% of the transfers up to an age of x 
years) 



ADVREG FiliD AUTD FUhK 


The form of the scatter diagram gives the idea to try with a bell shaped curve (normal distri- 
bution). See the equation and my guesses for the parameters: 


( FiT T FT? T Y FHT t 

|Tools|Honl inear |Logistic|T5L5| 


Honlintor Input 


Select dataset transf + 

Equation: -b*Cage-c>^2)+d 
Parameters: |=2Q, b=l , c=25. 5, d=0> 
Weight variable 


Dne 

Dne 


■ Pi 

, tEnter=0K 5 

CESC=CFlNCED, 

Dne 

■ nonl inO 

Error: Break 

NonLinO 

ADVREG RAD AUTD 

FUNC E/30 



i 


menui,,i 
1 nonl inO 
1 nonl inO 
1 f eqn 

Pt = 4. 1992E-36-* 


: |fllgebra|Calc|QthenPrgpiIo1ciean Up 


“□5FT 
Error: Break 
Done 

6. 5833 ■ age - . 12S4age> 


■ 4. 1991 e - 36 -e 


6. 5S326 ■ x ■ 


. 128358 ■ 


+ . 3^ 

Done 


■ 8358*x a 2>+-3514469-Kp2 <x> 


The first attempt is successful. 



fithtiY fl tf3 Y fh 'iff y frf i 
' Plot Setup |Ce 1 1 iHeader |Ca 1 g |U i i 1 Isiai 


DATA 







c4 

c5 

c6 

c7 

c3 

1 

0 

.4178 

. 1745 


.5140 

2 

1 

.6725 

. 1073 


.7529 

3 

2. 5G0G 

1 . 5539 

.8950 


1.5934 

4 

5 

3.8357 

1 . 3556 


3.3279 

5 

7.5000 

8. 1614 

.4375 


3.1170 

6 

14 

13.894 

.0112 


13.363 

7 

17.500 

18.518 

1 . 0354 


13.530 


Bi'lc7=5 .9695626182406 


In our textbook we use the Solver of Excel in order to find the distribution curve. Compare the 
results. 


Alter 

% der Transf 

Modell 

SSE 

Graph der Regressionslinie 






19 

0 

0,51406246 

0,26426021 

18 

0,46318172 






20 

1 

0,75294268 

0,06103732 

18,1 

0,46522826 






21 

2,5 

1,59833618 

0,81299764 

18 








22 

5 

3,82780734 

1 ,37403564 

18 


Altersverteilung bei Transfers 


23 

7,5 

8,11685939 

0,3805155 

18 



24 

14 

13,8626892 

0,01885425 

18 




25 

17,5 

18,5303933 

1,06171036 

18 

25 






26 

20 

19,2285086 

0,59519895 

18 




27 

16 

15,4770061 

0,27352258 

18 

2 20 - 


* 


28 

9,5 

9,71537976 

0,04638844 

18 

£ 

/ \ 


29 

4 

4,8526728 

0,7270509 


to 

HI- 

/ \ 


30 

2 

2,06346327 

0,00402759 

19 

£ 15 - 

d \ 


31 

1 

0,9076181 

0,00853442 

19 

a 3 

T \ 


32 

0 

0,55239875 

0,30514438 

19 


/ \ 



+Z 1 u 





SSE= 

5,93327817 

19 

S 

IM 

* \ 






19 

Ci 

Jr r 

/ \ 


a 

b 

c 

d 

19 

CL □ - 

/ 4^ 


19,0852513 

0,130403987 

25,6452902 

0,45383993 

19 








19 

0 -i 

W 1 1 

1 • 






19 

15 

20 

25 

30 

35 


See next pages: Don's tools provided for DERIVE and TI-NSpireCAS 
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#16: [Predi'cted_Value5([2Q10]) , Predi'cted_Value5([2020]) ] 


#17: 


9S% Confidence Interval 
Value 341.94977 

Se_yhat/Se_YO 10.463537 11. 158423 

CI_yhat 323.68166 360.21787 

CI_Y0 322.46847 361.43106 

Source DF SS 

4 


#18 : anova 


MS 


9S% Confidence Interval 
Value 387.54825 

Se_yhat/Se_Y0 13.764359 14.299733 

CI_yhat 363.51730 411.57920 

C I_Y0 36 2 . 58 26 0 412 . 5139 0 

F Pnob(F) 


#19 : 


Reg 2 7.1682365-10 3. 5841182-10 2385.4704 0 

Error 16 240.39657 15.024786 

4 

Total IS 7 .1922761-10 

REST([pop4,4.2, popul]’) 

-12 9 12 

203.55302- (ERF(4. 9268136-10 ■ (1,812418-10 -x - 3.5747336-10 )) + 1) 

-8 0.0115 26649 -x 

3,2359135-10 -e - 31.280723 



Applying the Marquardt-Levenberg algorithm gives the same result: 

b- (y - 1790) 

#20: HARQUARDT(p = c-e + a, [a, b, c], [10, 0,03, 20], pop) 

-8 0. 011526663 -y 

#21: faq = (p = 3,2358222-10 -e - 31,280608) 


Next example tries a trigonometric fit of temperature values. 
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# 22 : 

#23: 

#24: 



" X 0.3 1.3 2.3 3.3 4.3 

3.3 

6.3 

7.3 

8.3 

9.3 

10.3 11.3 " 

temp 

. y -3.7 -3.7 1.7 8 14.7 

15.8 

21.8 

15.6 

14.2 

7 

0.6 -2.9 . 

GAUSS_NEl#T0N(y = a-SIN(b-x + c) + d, [a, 

i b, c 

i d], 

[13.3, 

1, 1, 

S-B] 

■ temp) 


feq = (y = 13 , 353080- SIN (0, 53176 799- x + 4,4243910) + 8.1265919) 



Marquardt-Levenberg did not work! 

Sometimes it is necessary to try some different guesses of the initial values for the parameter 
in order to receive a satisfying answer: Once more the age-transfer model from page 32: 


#34: transf 


#33 : 


G AU SS_N EWTO N Lt = bO-e 


21 22 

23 

24 23 

26 

27 28 

29 30 

31 

32 ' 

i 

2.3 3 

7.3 

14 17.3 

20 

16 9.3 

4 2 

1 

0 . 



2 








bl- (a - 

b2) 

0 

JO 

1 1 

m 

_□ 

-1- 

. bi, 

i — i 

m 

JO 

L"-J 

JO 

. [20, 1, 

LU 

O 

1], 

transf^ 


#36: -93 14.047781-a - 0.22487613-a 

feq = 9 , 2273392 - 3. 0942497-10 -e 


#37: 


GAUSS_NEWT0NU: = b0-e 


- bil- (a - b2) 


+ b3 , [bO, bl, b2, b3], [20, 0.1, 30, 0.1], transfj 
2 


#38 : -36 6.6884918- a - 0.13040384- a 

feq = 1. 0808899-10 -e + 0.43382339 
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This is a more complicated function to approximate. In DNL#64 I presented a “Traffic Density 
Application”. In DNL#64 I used the slider bars to find a function composed of two bell shaped 
functions. Now let's try with Don's tools and compare the results: 


#45 : 


#46: 


#47: 


r h - 

tabvz := 

L d_ 


GAU5S_NEWTQN 



0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

U. 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

100 

75 

50 

125 

290 

410 

610 

860 

1000 

990 

810 

620 

580 

500 

475 

550 

850 

900 

600 

475 

300 

240 

150 

120 

100 


2 2 
- C(h_ - bl)/b2) - ((h_ - b4)/b5) 

,d_ = bO-e + b3-e + b6 , [bO, bl, b2 , b3 , b4 , b5 , b6], [1000, 9, 3, 900, 17, 3, 1], tabv; 

2 2 'l 


1. 374721- h_ - 0. 08044227- h_ 

2 , 5550863-e 


-11 3. 643272 ■ h_ - 0.10924957 ■ h_ 
+4.6785709-10 -a 



MARQUARDT delivers the same result as Gauss-Newton! 

Comparing the Sum of Squared Errors (SSE) gives 6.0314 • 10 4 for Gauss-Newton and 
1 .1293 • 10 5 for the manually performed “Slider Bar Approximation”. 

Don provides the Two Stage Approximation for DERIVE, too This is the documentation fol- 
lowed by the two Tl-examples which are shown above: 

The program 

TwoStage(eq_, endog_, exog_, data_, i ncept_:=l , vardef_: =1) 

computes the 2SLS regression coefficients, standard errors of the coeffi- 
cients, t values and their probabilities, the AN0VA table, root MSE, 
R_square, and adjusted R_square. 

The inputs are: 

eq_: the equation to be solved for 

endog_: a vector of the endogenous variables 

exog_: a vector of the exogenous variables 

data_: the name of the dataset where the first row of the data matrix 

contains the names of the variables 

incept_: the default of 1 indicates that an intercept is to be computed 
for the equation; set incept:=0 if you do not want an intercept. 
vardef_: the default of 1 sets the variance denominator to the degrees of 
freedom; changing vardef_ to 0 sets the denominator to the number of ob- 
servati ons . 

Note : According to SAS, if the no intercept option is set (incept_=0), 
the definition of the R_square statistic for two-stage least squares is 
changed to 1 - (Residal Sum of Squares/Uncorrected Total Sum of Squares). 
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In addition, the program 

Model (mm_,endog_, exog_,data_,i ncept_:=l, vardef_:=l) 

solves for all the of equations at once, mm is a matrix of the equations 
with one equation per row. 


#9: 



q = p + d 


Model 

\ 

.. q=p+f+a. 

i [q. p] i [d i f, a], data 

J 


q = - 0. 2437076898- p + 0 . 3143821013 • d + 94.61485336 
q = 0. 240567722- p + 0 . 2528436233 • a + 0 . 2560236611- f + 49.44820072 


I tried to produce a 3D presentation of the q(p,d) model. I am satisfied with the result. (Josef) 
plot := VECTOR ([pt] , pt, REST([data,i 42 , data.143, data^l]’ )) 
z = - 0,2437076898.x + 0, 314 38 21013- y + 94,61485336 



And this is the second example from above performed with DERIVE (with and without inter- 
cepts): 



s' 

c = p + plag + wsum 


Model 


i = p + plag + klag 

i [c, p, w, i, x, wsum, k, y ] , [klag, plag, xlag, wp, g, t, yr], klei'n 



w = x + xlag + yr 

J 


c = 0. 01730294013- p + 0.8101827436-wsum + 0.2162334038-plag + 16.55475199 
i = 0.1502217351-p + 0 . 6159436535 • pi ag - 0. 1577876492- klag + 20.27821173 
w = 0 . 438858637 - x + 0. 1466742262 -xlag + 0.1303957913-yr + 1.500299135 



f 

c = p + plag + wsum 


Model 


i = p + plag + klag 

, [c , p, w, i, x, wsum, k, y], [klag, plag, xlag, wp, g, t, yr], klei'n, 0 



w = x + xlag + yr 

J 


c = 0.1583375706-p + 1 . 121162681- wsum + 0.2651214661- pi ag 
i = 0. 4459657099 -p + 0 . 3685120348- pi ag - 0 . 06148091416- kl ag 
w = 0.4444584525-x + 0 . 166299164- xl ag + 0.1129951702-yr 
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Don has no problems to transfer his programs from DERIVE to the Tl-handhelds (V200 & 
TI-89) but also to TI-NspireCAS. File Phillps_AdvReg.tns for TI-NspireCAS provides Nonlin- 
ear Regression for Tl-Nspire. I print two screenshots because this might be of special inter- 
est for us teachers when showing the students how to fit a power function by transforming it 
to a linear function. We know that this is not quite correct because the linear regression is 
applied on transformed data. This is what Don has to tell - and to demonstrate - us: 


Linear vs* Nonlinear Regression 

In some cases a nonlinear function can be linearized through a transformation of the data. While this usually gives 
reasonable results, sometimes it may not. The reason is that a linear regression with transformed data does not 
minimize the sum of squared errors of the non-transformed data. The data in spreadsheet 1 .8 is from a project fitting 
a power cun/e to the data. The curve was originally estimated with Excel. 

gaussnewton(ln(r_)=fl+A i ln(m_) J {fl=l J fi=l },1,3G,1Q,10 _S J 

This is a natural log transformation of the equation r_=a*m_ A b. The output is in 1 .9. The fitted equation is solved for 
r_, and a=1 8. 31 9 and b=— 0. 3535. This is also the result that the Tl-Nspire produces for the Power Regression (see 
1 .8). In other words, it is not a true power regression; it is a linear regression of a log transformed equation and data. 

The untransformed equation is also fit.] 
gaussnewton(r_=a , { a=16,£=-0.35 }, 1,30, 10, 10 _d 

Here a =57. 821 and b=— 0. 52927. The graphs in 1 . 1 0 show both equations. It is up to the analyst, of course, to 
determine which fit is best for her purposes. 



